PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG028866t1
Common NameTCM_028866
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 839aa    MW: 92223.8 Da    PI: 6.5422
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG028866t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.41.2e-181775357
                      --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
          Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                      k  ++t+eq+e+Le+l++++++ps  +r++L +++    +++ +q+kvWFqNrR +ek+
  Thecc1EG028866t1 17 KYVRYTPEQVEALERLYHECPKPSSIRRQQLIRECpilsNIEPKQIKVWFQNRRCREKQ 75
                      5679*****************************************************97 PP

2START178.15.3e-561613692205
                       HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT..EEEE CS
             START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg..galq 92 
                       +aee+++e+++ka+ ++  Wv+++ +++g++++ +++ s++++g a+ra+g+v  +++  v+ell+d++ W ++++++++l+v+ ++  g+++
  Thecc1EG028866t1 161 IAEETLAEFLSKATGTAVEWVQMPGMKPGPDSIGIVAISHGCTGVAARACGLVGLEPT-RVAELLKDRPSWFRDCRAVDVLNVLPTAngGTIE 252
                       7899******************************************************.9*************************9999**** PP

                       EEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHH CS
             START  93 lmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwll 181
                       l +++l+a+++l+p Rdf+ +Ry+  l++g++v++++S+ ++q+ p+    +++vRae+lpSg+li+p+++g+s +++v+h+dl+ + ++++l
  Thecc1EG028866t1 253 LLYMQLYAPTTLAPaRDFWLLRYTSVLEDGSLVVCERSLKNTQNGPSmpaVQHFVRAEMLPSGYLIRPCEGGGSIIHIVDHMDLEPWRVPEVL 345
                       **********************************************999999***************************************** PP

                       HHHHHHHHHHHHHHHHHHTXXXXX CS
             START 182 rslvksglaegaktwvatlqrqce 205
                       r+l++s+++ ++kt++a+l+++++
  Thecc1EG028866t1 346 RPLYESSTVLAQKTTMAALRQLRQ 369
                       ********************9876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007115.5971276IPR001356Homeobox domain
SMARTSM003891.2E-151480IPR001356Homeobox domain
SuperFamilySSF466894.71E-171680IPR009057Homeodomain-like
CDDcd000869.31E-171777No hitNo description
PfamPF000462.9E-161875IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.601.6E-181975IPR009057Homeodomain-like
CDDcd146861.26E-669108No hitNo description
PROSITE profilePS5084825.437151366IPR002913START domain
CDDcd088757.97E-79155371No hitNo description
Gene3DG3DSA:3.30.530.205.4E-23160367IPR023393START-like domain
SMARTSM002343.9E-40160370IPR002913START domain
SuperFamilySSF559611.09E-37160372No hitNo description
PfamPF018521.6E-53161369IPR002913START domain
PfamPF086708.3E-53695837IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009965Biological Processleaf morphogenesis
GO:0010014Biological Processmeristem initiation
GO:0010075Biological Processregulation of meristem growth
GO:0010087Biological Processphloem or xylem histogenesis
GO:0048263Biological Processdetermination of dorsal identity
GO:0080060Biological Processintegument development
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 839 aa     Download sequence    Send to blast
MAMSCKDGKL GNLDNGKYVR YTPEQVEALE RLYHECPKPS SIRRQQLIRE CPILSNIEPK  60
QIKVWFQNRR CREKQRKEAS RLQAVNRKLT AMNKLLMEEN DRLQKQVSQL VYENGYFRQH  120
TQNATLATKD PSCESVVTSG QHHVTPQHPP RDASPAGLLS IAEETLAEFL SKATGTAVEW  180
VQMPGMKPGP DSIGIVAISH GCTGVAARAC GLVGLEPTRV AELLKDRPSW FRDCRAVDVL  240
NVLPTANGGT IELLYMQLYA PTTLAPARDF WLLRYTSVLE DGSLVVCERS LKNTQNGPSM  300
PAVQHFVRAE MLPSGYLIRP CEGGGSIIHI VDHMDLEPWR VPEVLRPLYE SSTVLAQKTT  360
MAALRQLRQI AQEVSQSNVT GWGRRPAALR ALSQRLSRGF NEALNGFTDE GWSMMGNDGM  420
DDVTILVNSS PDKLMGLNLS FANGFPSVSN AVLCAKASML LQNVPPAILL RFLREHRSEW  480
ADSSIDAYSA AAVKVGPCSL PGSRVGGFGG QVILPLAHTI EHEEFLEVIK LEGVAHSPED  540
AIMPRDVFLL QLCSGMDENA VGTCAELIFA PIDASFADDA PLLPSGFRII PLDSGKEASS  600
PNRTLDLASA LEIGPTGNKA SNDYSGNSGC MRSVMTIAFE FAFESHMQEH VASMARQYVR  660
SIISSVQRVA LALSPSHLSS HAGLRTPLGT PEAQTLARWI CQSYRLYMGV ELLKSGSEGS  720
ETILKTLWHH SDAIMCCSLK ALPVFTFANQ AGLDMLETTL VALQDITLEK IFDDHGRKTL  780
CTEFPQIMQQ GFACLQGGIC LSSMGRPVSY ERAVAWKVLN EEENAHCICF MFINWSFV*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00200DAPTransfer from AT1G52150Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007024277.10.0Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein isoform 1
SwissprotQ9ZU110.0ATB15_ARATH; Homeobox-leucine zipper protein ATHB-15
TrEMBLA0A061GIJ30.0A0A061GIJ3_THECC; Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein isoform 1
STRINGVIT_09s0002g03740.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM20222678
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G52150.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]